CDS

Accession Number TCMCG075C07464
gbkey CDS
Protein Id XP_007045132.1
Location complement(join(35205923..35206247,35208268..35208347,35208493..35208695,35209025..35209112))
Gene LOC18609772
GeneID 18609772
Organism Theobroma cacao

Protein

Length 231aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007045070.2
Definition PREDICTED: homeobox-leucine zipper protein HOX3 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description homeobox-leucine zipper protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03000        [VIEW IN KEGG]
KEGG_ko ko:K09338        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0001067        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003677        [VIEW IN EMBL-EBI]
GO:0003700        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0006355        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009889        [VIEW IN EMBL-EBI]
GO:0010468        [VIEW IN EMBL-EBI]
GO:0010556        [VIEW IN EMBL-EBI]
GO:0019219        [VIEW IN EMBL-EBI]
GO:0019222        [VIEW IN EMBL-EBI]
GO:0031323        [VIEW IN EMBL-EBI]
GO:0031326        [VIEW IN EMBL-EBI]
GO:0044212        [VIEW IN EMBL-EBI]
GO:0050789        [VIEW IN EMBL-EBI]
GO:0050794        [VIEW IN EMBL-EBI]
GO:0051171        [VIEW IN EMBL-EBI]
GO:0051252        [VIEW IN EMBL-EBI]
GO:0060255        [VIEW IN EMBL-EBI]
GO:0065007        [VIEW IN EMBL-EBI]
GO:0080090        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:0140110        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]
GO:1903506        [VIEW IN EMBL-EBI]
GO:2000112        [VIEW IN EMBL-EBI]
GO:2001141        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCGGTTTTACCCACCGGCTCTTCTAACTTGGAGTTGACAATATCTGTTCCCGGCTTCTCTTCTTCCCCTTCTCTTCCTTCTTCTGGTGATCAAGGGGGTTGTACGGTGAGAGATTTAGATATAAACCAAGTACCATCGGGAGGAGCAGAAGATGAATGGATCACAGCAAGCATGGAGGATGAAGAAGAAAGCTGCAATGGAGCCCCTCCTCGCAAAAAACTTCGTCTTACAAAAGAACAGTCTCGCCTTCTTGAAGAAAGTTTCAGACAAAACCATACCCTAAACCCTAAGCAGAAAGAAGCATTGGCTATGCAGCTGAAGCTGAGGCCAAGGCAGGTTGAAGTTTGGTTCCAGAACCGTAGGGCCAGGAGCAAGTTGAAGCAGACAGAGATGGAGTGTGAGTACCTGAAAAGATGGTTTGGATCACTGACTGAACAGAACAGGAGGCTGCAAAGAGAGGTGGAGGAGCTAAGGGCCATGAAAGTAGGGCCACCAACCGTGATTTCGCCTCACAGCTGTGAGCCTCTCCCAGCATCAACCCTTACAATGTGCCCTCGGTGCGAGCGAGTCACCACCACTGCCCTTGACAAGGGCCCCACCAAAATGACCGCCGCCACCGCCACTGCCACCACATTGTCGTCTAAAGTTGGGACATCGGCCCTCCAATCAAGGCCATCTTCGGCGGCTTGTTAG
Protein:  
MAVLPTGSSNLELTISVPGFSSSPSLPSSGDQGGCTVRDLDINQVPSGGAEDEWITASMEDEEESCNGAPPRKKLRLTKEQSRLLEESFRQNHTLNPKQKEALAMQLKLRPRQVEVWFQNRRARSKLKQTEMECEYLKRWFGSLTEQNRRLQREVEELRAMKVGPPTVISPHSCEPLPASTLTMCPRCERVTTTALDKGPTKMTAATATATTLSSKVGTSALQSRPSSAAC